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Introduction. Here I digress from work in progress—namely, a review of a 
paper by C. Y. She & H. Heffner 1 , which was the first of several papers inspired 
by E. Arthurs & J. L. Kelly’s “On the simultaneous measurement of a pair of 
conjugate observables” (BSTJ 44, 725 (1965)); it is my intention to incorporate 
that material into an account of generalized quantum measurement theory on 
which I was at work two digressions ago—to sharpen my understanding of 
some elementary material that originated in Dirac’s treatment of the theory 
of quantum oscillators, but which acquired new interest and was carried to a 
higher state of development when Roy Glauber 2 and others laid the foundations 
of quantum optics. My interest here is not in quantum optics but in the details 
of those mathematical “higher developments,” as they relate oscillators and to 
the argument that lies at the heart of the She/Heffner paper. 

I expect to draw my material primarily from Chapters 2 & 3 of Christopher 
Gerry and Peter Knight’s Introductory Quantum Optics (2005) and Chapter 0 
of my own Advanced Quantum Topics (2000). 

Oscillator basics. The oscillator Hamiltonian reads 


H = o^(p 2 + mVq 2 ) 


We set to = 1 and introduce non-hermitian operators 

a =7sfe^ q+ip) | 

a+= 7^^ q - ip) J 


giving 

H = hu{ a + a + 1 1) 
[a, a+] = I 


(1) 


( 2 ) 


Let the dimensionless hermitian “number operator” N -the name of which 
makes natural sense only in quantum-field-theoretic or quantum optical contexts 


1 “Simultaneous measurement of noncommuting observables,” Phys. Rev. 
152 , 1103 (1966). 

2 “Coherent and incoherent states of the radiation field,” Phys. Rev. 131 , 
2766 (1963). 
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where one is working either explicitly or implicitly in Foch space—be defined 

N = a+a (3) 

It becomes more natural henceforth to speak of N than of H = hto{ N + 11). 
Suppose N|j/) = v\v). Then 

a+N 1 1 /) = a+(aa+ - l)|i/) = (N a + - a + )|i/) = va + \v) 

$ 

Na + |i/) = (v+ l)a + \u) 

and similarly N a W) = {v-l)a \u) 

Therefore N a p \v) = {y—p)a p \v) : p = 1,2,3,... But N is positive semi-definite: 
if |/3) = a |cc) then (a|N|a) = |||/?)|| 2 > 0 (all |a)). To truncate the descending 
series (v — p) : p = 1 , 2 , 3, ... we are forced to take v to be an integer and to 
posit the existence of a ground state (“vacuum” state) |0) such that 

a|0) = 0 (4) 

Building on that foundation, we have 3 

|1) = c 0 a+|0) 

|2)= Cl a + |l) 

|3) = c 2 a + |2) 

\n + 1) = c„a + |n) 


To evaluate the constants c„ (which can without loss of generality be assumed 
to be real) we proceed 


(n|a a + |n) = (n + l)(n|n) = n + 1 

= c“ 2 (n+ l|n+ 1) = c~ 2 


1 


It now follows that 


|n) 



1 

y/n(n-l) 


- 1 ) 

( a+ ) 2 | n 


2 ) 


= ^f( a+ )”|0) (5) 

For the purposes at hand these results are most conveniently written 
a|n) = g n \n - 1), a+|n) = g n +i\n + 1) with g n = \/n 
from which we recover N|n) = a + a|?r) = g n a + \n — 1) = g n g n \n) = n\n). If 

3 Here I borrow from my “Simultaneous measurement of noncommuting 
observables” (October 2012), page 5. 
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quantum oscillators were the objects of interest we could, by H = fuo (N + \ I), 
write 

H | n) = E n \ri) with E n = fuo (n + |) 
and easily recover 4 descriptions of the familiar oscillator eigenstates 

ipn(q) = (q\n) 

But our present interest lies elsewhere. 

Eigenstates of creation/annihilation operators: “coherent states”. We want to 
describe the solutions of 


a\a) = a\a) equivalently (a|a + = (a|ai 

where by the non-hermiticity of a we expect the eigenvalues a to be complex. 
The states |a) are, for reasons that will emerge, called “coherent states.” Writing 


= J 2 A n\n) 


n =0 

we have 

a|a) = ^2A n y/n\n - 1 ) = a^2,A n \n) 

giving A n ^/n. = aA n -i whence 


a 


A — A — 

Ai i—sin— 1 ,—7 - —r 

v™ \/ n(n — 1) 


A n —o — • • • — 


a 




Therefore 


|a) = A 0 ^2-^=\n) 

Vnl 

Normalization requires 

1 = («|a) = |^o| 2 EE ^=H«) = l^o| 2 E^T“ = l^>| 2 e 

^ ym!n! n\ 


a|2 " = l4J2> I 2 


and supplies 


A Q _ g-jM 2 . gt (arbitrary phase) 


Discarding the phase factors, we have 


I a) 


= e-iM 2 ]T 


n 



( 6 ) 


where a ranges freely on the complex plane. 

Let a and /3 mark two points on the complex plane. Immediately 


4 See Griffiths, §2.3.1 or Advanced Quantum Topics, Chapter 0, pages 40-41. 
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*) = 




EE 


m n 

. i i<ii 2_ i i„,i 2 x—v f3 n a 


y/m\n\ 


(m\n) 


- e -hm 2 -hW\ 2 


E ; 

n 

= e -^l/3| 2 -||a| 2 +^a 


ni 


Observing that 

-{|/3| 2 - ||a| 2 + /3a = —1/3/3 - ^aa + |/3a + |/3d 

+ ^/3a — |/3 a 

= —1|/3 — a| 2 + | (/9a — /3a) 

we have 

(/3|a) = i?(/3, a) e M0 ' a) 

where R(/3, a) = exp { — ^|/3 — a| 2 } and i<p(/3, a) = ^(/9a — /3d). So 
(/3|a) ± 0 

« 0 when |/3 — a| is large 


(7.1) 


(7.2) 


But while coherent states—though normalized—are not orthogonal, they 
are complete in the sense 


l«)—H = i 

7r 


dr ol — da r da 


( 8 ) 


as I demonstrate: 5 We have 


/ l“) V (a| = i EE 7=h{ /«- | " |! a’"o”d 2 o}| ra )( n | 

In polar coordinates a = re 16 and d 2 a = rdrdO so we have 

pOO p2lV pOO 

{etc.} = / e~ r2 r m+n+1 dr ■ / e i{n - m)e d9 = 2n6 mn r m+n+1 dr 

Jo Jo Jo 

giving 

/ 2 q roo 

\a) — (a| = E 3 / e -T ’ 2 r 2n+1 G?r|n)(?r| = ^ |n)(n| = I 
75 n n - •'o 

where we have used 2 J 0 °° e _r r 2n+1 dr = r(n + 1) = n!. While this establishes 


5 Here again, my source is Gerry & Knight’s §3.5. 
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completeness, the coherent states |a) are in fact “overcomplete.” For we have 

I/?)- / H^N/3) 

-1a) era = weighted superposition of |a) states (9) 

The “reproducing kernel” 


R(a, P) e 


K(a,P) 


R(a,P) 

7T 


i e -hwp-m 2 +«0 

7r 


stands where a delta function d(a — 0) would stand if the coherent satates were 
linearly independent. Notice that \p) itself contributes- but only fractionally— 
to the preceding representation of \P): K(P,P) = A “Overcomplete bases” are 
not commonly encountered, but are not intrinsically bizarre: look, for example, 
to the quartet of normalized 2-vectors 



which are seen to be complete in a sense 


E 


3 =1 


e 3 e 3 


0 

1 


that entails 

e k : E e i K j fc with K 3k = e fc) 
j 

The fractional contribution of e k itself to the sum enters with weight I\ kk = \ ■ 
The model fails, however, in one respect: the coherent states |a) arose as 
eigenstates of a linear operator a, but the vectors e k cannot be produced 
as eigenvectors of a matrix A; the matrices P j = are complete but 

are neither projective (PjPj = |Pj) nor orthogonal. Spectral decomposition 
methods are evidently not available when one is dealing with overcomplete 
bases. 

We cannot expect the transformation from the complete orthonormal basis 
{jn)} to/from the overcomplete non-orthogonal basis to be unitary (or even to 
be “square”). And indeed, 


J (m\a)^(a\n) = ( m\n) = S mn 
^(a|n)(n|/3) = (a|/3) = R(a,P) ^ <5(a — P) 

n 
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so while ( a\n ) possesses a left inverse, it does not possess a right inverse. For a 
finite-dimensional instance of such a situation consider the matrix 


U = 



which supplies 


U+U = 


1 0 
0 1 


\ 1 

0 

0 \ 

, UU+ = 0 

1 

0 

> lo 

0 

0 / 


Coherent representation of states and operators. Let by, denote the coordinates 
of \ip) in the number basis: 


IVO = \ n ^n with ljj n = ( n\lp ), = 1 

n n 

Passing to the coherent basis, we have 

W) = l f \a)d 2 a(a\ip) = \ f \a)d 2 a y 'ip n (a\n) 

n 

From the adjoint of (6) we obtain 


(a\ri) = e-5 |a|2 Y -ty=(m\n) = e“iW 2 ^ 

\fn\ 


giving 


IV’) 



n 



( 10 . 1 ) 

( 10 . 2 ) 


Here {etc.} is the a th coherent coordinate of \tp) and ip(z) is an element of the 
“Bargmann-Segal space” of entire functions. 6 

6 V. Bargmann, “On a Hilbert space of analytic functions and an associated 
integral transform,” Comm. Pure & Applied Math. 14, 187-214 (1961) and 
“Remarks on a Hilbert space of analytic functions,” PNAS 48, 199-203 (1962); 
I. Segal, “Mathematical characterization of the physical vacuum for a linear 
Bose-Einstein field,” Illinois J. Math. 6, 500-523 (1962). In “On the applications 
of Bargmann Hilbert spaces to dynamical problems,” Annals of Physics 41, 
205-229 (1967) S. Schweber illustrates the utility of Bargmann-Segal concepts 
in quantum contexts that have nothing to do with coherent radiation (but much 
to do with oscillators.) 
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Turning from states to operators.. .in the number basis a linear operator 
F -not presently assumed to be hermitian—acquires the representation 

F = ^ \m)F mn (n\ with F mn = (m|F|n), elements of F 

m,n 

In particular, we have 


/° 

0 

0 

0 -A 

0 

1 

0 

0 ••• 

0 

0 

2 

0 ••• 

0 

0 

0 

3 ••• 

\i 





and use a |n) = y/n \n — 1), a + |n) = \/n + 1 |n — 1) to obtain 



/° 

v 7 ! 

0 

0 



( °- 

0 

0 

0 



0 

0 

V2 

0 


71 

0 

0 

0 

A = 

0 

0 

0 

73 


, A + = 

0 

72 

0 

0 



0 

0 

0 

0 



0 

0 

73 

0 







■■■) 


V ; 




• J 


In the number representation the operators 


q = y/h/2(a + + a), p 

= i\Jh! 2(a + 

- a) 

therefore become 

( 0 

+71 

0 

0 

-\ 


71 

0 

+72 

0 

Q = VhJ 2 

0 

72 

0 

+73 


0 

0 

73 

0 



V ; 




• J 


( 0 

-71 

0 

0 



71 

0 

-72 

0 

¥ = i^h/2 

0 

72 

0 

-73 


0 

0 

73 

0 



V : : : : 

which are seen to be hermitian. The fundamental commutator [q. p] = ih I 
does not admit of finite-dimensional representation, since tr[Q,P] = 0 in all 
finite-dimensional cases, while tr [ih I] = ih oo; for the matrices described above 
we in (for example) the 5-dimensional case obtain 

[Q, P] = ih ■ diag{l, 1,1,1, —4} ^ ihl 


which is traceless. In the oo-dimensional case the final term is “pushed out of 
the picture” and (formally!) we recover [Q,P] = ihl. 
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Passing from the number basis to the coherent basis, we have 
F = ^ \m)F mn {n\ 

m,n 

= ^2 [["Yl \a)d 2 a(a\m)F mn (n\/3)d 2 (3(/3\ 

^ ^ m,n 

= ^Jj \a)d 2 a{e~^ 2+ W 2 ^(a,P)}d 2 m 


where 


J(d,/3) = ^F ? 


a m t3 n 


-Jm\n\ 

mn 

= e KM 2 +l/?l 2 ) (a | F | 


On the diagonal 




(a| F ja) = J(a, a) = V' F mn -^= 

tr v ^ 


r.lrj.l 


( 11 . 1 ) 


( 11 . 2 ) 


Since the |a)-basis is overcomplete, it is perhaps not surprising that the diagonal 
elements (a|F|a) are sufficient to determine all the matrix elements (to| F|n): 


mn \8a 


mu™ 


Look to the coherent description of the trace. We have 

trF =5Z( n |F| n ) = // (n\a)d 2 a(a\F\P)d 2 P(P\n) 

n n 

= ^2 ff ^2d 2 a{a\F\f3)d 2 f3{f3\n)(n\a) 

n 

= ^JJ d 2 a(a\F\p)d 2 p(p\a) 

But the overcompleteness relation reads 

i J \(3)d 2 (3((3\a) = |a) 

so we have 

trF = L f (a|F| a)d 2 a (12) 


which is—satisfyingly—quite as expected. 
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Evaluation of (a\ F /3) becomes trivial when F is presented as an 
a + a -ordered function of a + and a 


F(a + , a) 


J a 


for then 


(a | F |/3) = F(a, (3) ■ (a\/3) = F(a,/3)- e“2 |a| -jlfl + 
Look, for example, to the unitary “displacement operator” 7 

D(w) = exp {u a + — u a } 


(13) 

(14) 


When A and B commute with their commutator C = [ A, B ] we have the 
well-known Kermack-McCrae identity 8 


A + B 


e 2 c . g a gB 

g+2 ^ 


AB-ordered 
BA-ordered 


(15) 


From [wa + , — tia] = — |w| 2 [a + , a] = +\u\ 2 I we are led therefore to 
D(w) = e _ sM exp {wa + } exp {— rta} 

(a| D(w)|/3) = e -iM 2 +«a-«/3 

which provides a concrete instance of (13). More to the point and more 
interestingly: it is clear that 

OO 1 

a |0) = 0 =► exp{-ua}|0) = ^(-) fe -w 2 a fc |0)=a°|0) = |0) 


so we have 


k= 0 


s~^A 2 exp {wa + } 0) 


OO 

n=0 


OO 

n=0 V« ! 

by (5) 

u) 

by (6) 


(16.1) 

at which point it becomes clear why D(u) operators are called “displacement 
operators.” This result serves to establish the sense in which 


|a) = D(a)|0) is a “displaced vacuum state” 


(16.2) 


7 Roy J. Glauber, “Coherent and incoherent states of the radiation field,” 
Phys. Rev. 131, 2766-2788 (1963), §111. 

8 Advanced Quantum Topics (2000), Chapter 0, page 32. 
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Products of unitaries are unitary; looking to the product D(tt)D(v), we 

have 

D(?i) D(u) = exp {ua + — u a } exp {ua + — Da } 

Appealing again to (15) with C = [ua + — ua,va + — Da] = (itD — uv) I we find 

D(u)D(v) = e §(“®-™). D(u + u) (17) 

Since the multiplier e h( uv ~ uv ) is of the form e 1 ^ it is physically inconsequential, 
but (f> admits nevertheless of an interesting geometrical interpretation: if 
u — re 111 and v = se la then <j> = rs sin(p — a). Notice also that 

D + («) = D-’(u) = D(-u) (18) 


Minimum uncertainty states. The Schrodinger inequality 9 

(AA) 2 (AB) 2 ^ ( AB ^ BA ) 2 + [( AB y BA ) - (A)(B)J 2 (19) 

becomes 

in the most familar instance. Minimal uncertainty AxAp = f H is achieved 
if and only if [etc.] = “quantum correlation coefficient” 10 = 0. Familiarly, the 
underlying Cauchy-Schwarz inequality (f\f)(g\g) ^ I(/|5)| 2 becomes equality 
if and only if either |/) else |g) = 0 or | g) = \\/). In the present application 
|/) = 0 becomes A \ifj) = (A)]^), so | ip) is an eigenstate of A and (19) is reduced 
to a triviality: A A = 0 (else AB = 0). More interesting are the implications 11 
of |g) = A|/), which now reads 

(B - b)|V0 = A(A - a)|^) with a = <A)I, b = (B) I (20) 

Multiplication by (A — a) else (B — b) gives 

9 See “Simultaneous measurement of noncommuting quantum observables” 
(October 2012), page 3. 

10 In classical statistics, if x and y are random variables then one has (for all 
to and n) 

(x m y n ) = (x m )(y n ) iff x ane y are statistically independent 

The number (xy) — {x)(y) provides therefore a leading indicator of the extent to 
which x and y are statistically dependent or correlated. Generally AB 7 ^ BA. 
On the right side of (19) we are told to “split the difference.” See page 202 in 
D. Bohm, Quantum Mechanics (1951). 

11 See E. Merzbacher, Quantum Mechanics (2 nd edition, 1970), page 160. 
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A(AA) 2 = ((A - a)(B - b)> = (AB) - (A)(B) 
A' 1 (A Bf = <(B - b)(A - a)) = (BA) - (A)(B) 
which when added/subtracted give 


\{AA) 2 + X\AB) 2 
\(AA) 2 - \~\AB) 2 


2 

i( C 


AB + B A 


(A)(B) 


want = 0 (21.1) 

(21.2) 


where the hermitian operator C = |(AB — BA). The first of those equations 
supplies A 2 = —(AB/AA) 2 so A is necessarily imaginary: A = ±i(AB/AA). 
Returning with this information to (21.2) we obtain 


(AA A_B) min = ±1(C) : discard the absurd sign 


Reverting again to the case A = x, B = p we recover (Ax Ap) min = I n 
this instance (20) reads A^(p — p 0 \)\ip) = (x — xo\)\ip) which can be written 


(x - A' 1 p)\ip) = (x 0 - A Vo)|VO 


where xo = (x), po = (p), A 1 = —iAx/Ap. In the x-representation 

(*d x - po)i/>(x) = A(x-x 0 )V>(x) : A = z^| = i 2 [Ax) 2 


d x ip(x) = 


X — Xp 

2 (Ax) 2 

of which the normalized solution is 


ipox/h ip{ x ) 


^ exp {~ (la?)’ + inxlK ) (22 ' 1) 


Such “launched Gaussian” states were first encountered by Schrodinger in the 
paper 12 in which he reported the Schrodinger inequality (19), and in my own 
writing were most recently encountered 13 as states produced by Arthurs-Kelly 
measurements when the detectors read {xo,po}- At xo = po = 0 we have 

v ’“ w = (^(i^)‘ exi> {^(^) 2 } (22 - 2) 

= Vcentered normal distribution with standard deviation Ax 


12 “Zur Heisenbergschen Unsharfeprinzip,” Berliner Berichte, 296-303 (1930) 

13 See equation (12.1) on page 17 of 9 . In such measurements the value of Ax 
(as also of Ap = lfi/Ax) is set by the “balanced Gaussian” initial states of the 
detectors. 
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which becomes a description of the oscillator ground state when Ax is assigned 
a value set by the system parameters: Ax = i/2 rnco/H. 

More abstractly, we have ^(x) = (cc|0) where (0|0) = 1 and (x+*/3p)|0) =0. 
The latter equation can be written in a variety of ways, of which for our purposes 
the most interesting is a 10 ) = 0 where a is the dimensionless non-hermitian 
operator defined 


a 


Ax A p 


2 h 


x 

Ax 



= ^#" 1x+ ^ p ) 

£ = v/ Ax/A p 


(23) 


Here again, [x, p] = ih\ => [a, a + ] = I so we have been led back -this time 
not from oscillator theory but from Schrodinger’s identity—to the operators 
{a, a + } that support what is called the theory of coherent states but on the 
basis of the preceding argument might more reasonably be called the theory of 
minimal uncertainty states. 

The eigenstates a (which I will for the purposes of this discussion denote 
| u) because the symbol a will have other work to do ) can be obtained by 
“displacement” of the vacuum/ground state |0), as was remarked at (16). In 
the x-representation we have 


V’m(x) = (x|D(u)|0) 


where 

D(it) = exp { ua + — u a } 

= ex p { x - *£p) - x + ^p) } 

I 

= exp {|(a p + /3 x) } if we write u = ^=(— £“'a + ^P) 

= ex P {§!«/?}• ex P {|/? x } ex P 

supplies 

ip u (x) = exp • exp {|/3x} exp {ad x }il> 0 (x) 

= exp {| j[ a P} • exp { j:f3x}i/jo(x + a) 

= «p • (2^55)“ exp {“ (laf /+ il3xlh } 

If we discard the physically irrelevant phase factor exp{P|a/3} and set 
a = — X’o, P = po —which is in effect to set 

u = </2h “P *?Po) 

(and could, in view of (23), not be more natural)--we recover precisely (22.1). 
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Some additional means and uncertainties. The vacuum state |0) —which by 
one interpretation became the oscillator ground state, and was defined by an 
equation a|0) = 0 that implies N |0) = 0 —arose from the requirement that 

Ax A p = minimum 

Specifically, we found (0|x|0) = (0|p|0) =0 which gave 
AxAp = (0|x 2 |0)-(0jp 2 |0) = \n 

I look now to the evaluation of (n|x 2 |n)-(n| p 2 |n). From (23) we obtain 

X = \/h/2£, (a+ + a) 
p = iyfh/2 £ _1 (a + - a) 

Drawing upon a \n) = \fn \n — 1), a + | n) = \/n+ 1 \n + 1) and the orthogonality 
of the |n)-states we have (n|x|n) = (n|p|n) = 0, so 

A n x = \J(n\x. 2 \n), A n p = \/(n|p 2 |n) 

where 

(n| x 2 | n) = (fi/2)£ 2 (n|a + a+ + a + a + a a+ + a a | n) 

But (n|a + a + |?z) = (n|aa|n) = 0 by orthogonality, while 

(?j,|a + a + a a + |?r) = (n| N + 11 n) = n + 1 

so we have 

(rc|x 2 |n) = (fi/2)£ 2 (n + 1) 

Similarly (n|p 2 |n) = (fi/2)£ -2 (n+ 1) 

so 

A n x A n p = ^ h (24) 

which gives back the familiar result at n = 0. A simple adjustment | n) —> |a) 
of the preceding argument supplies 

(a| x |a) = \J fi/2 £ (a + a) 

(a|p|a) = \fh]2 £ _1 (d - a) 

(a| x 2 |a) = (fi/2)£ 2 [(a + a) 2 + 1] 

(a|p 2 |a) = (fi/2)£ _2 [(d — a) 2 + 1] 

If the system is in state |a) then 

(n) = (a| N | a) = (a|a + a |a) = da = |a| 2 (25) 


(n\a) = e ^ ^ -j=(n\m) = e l\ a \ 2 ~^= by (6) 
v m! V nl 


|(?i|a)| = e 


a- 

2 = p -|a| 2 dV = e -|a| 2 hf! = c -(n) (») ? 


nl 


nl 


nl 


= p(n\ (n)) (26) 
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Obviously p(n; (n)) = 1. The discrete distribution p(n; (n)) is precisely 
the Poisson distribution with parameter (n). Evaluation of the higher moments 

OO 

m„ = ^^n !y p(n) = (a|N' y |a) 

fe=o 

poses an interesting operator-ordering problem, and leads to a result that can 
be described 

m v = T„((n)) 

where T u {x) = YHk=i^iy^) xk is a “Torchard polynomial ” 14 and S{v,k) is a 
“Stirling number of the second kind.” Thus 


m 0 = 1 

mi = ( n ) 

m 2 = (n) + (n) 2 

m 3 = (n) + 3 (n) 2 + (n) 3 

ni 4 = (n) + 7(n) 2 + 6(n) 3 + (n) 4 

= (n) + 15(n) 2 + 25 (n) 3 + 10(n) 4 + (n) 5 

which at (n) = 1 become “Bell numbers.” 15 In particular, we have 

m 2 = (a|N 2 |a) = (a|a + aa + a|a) 

= (a|a + a + aa + a + a|a) 

= (da) 2 + (da) = (n) 2 + (n) 


in agreement with a result just stated. These results supply 



variance = mean 


14 Such polynomials are sometimes called “Bell polynomials,” and are 
produced in Mathematica by the command BellB[^,x] once one has installed 
the Combinatorica package. 

15 The Bell number B„ (produced by the command BellB[>]) describes the 
number of distinct partitions of a ^-element set. Look, for example, to the 
3-element set {a, &, c}, of which the distinct partitions 

{(«),(fo),(c)} 

{(a),( 6 ,c)} 

{( 6 ),(a,c)} 

{(c), (a, 6 )} 

{( a,b , c)} 


are = 5 in number 
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so the fractional uncertainty becomes 


An 

(n) 




which decreases as the mean (n) = \a\ 2 increases. Recall in this connection 
that (by the central limit theorem) as the mean becomes large the Poisson 
distribution goes over to the Gaussian (or normal) distribution with that same 
mean = variance: 


p(n; m ) 


V2 


7TTO 


: exp 


r (n — to) 2 'i 

I 2m J 


set m = (n) 


Harmonic motion of coherent wavepackets. Look to the hermitian operator 
defined 

H=MN + i|) : N = a+a 

where a is given by (23). Some quick algebra supplies 

h = i w (? 2 p 2 + r 2 x 2 ) 

= 2 ^(P 2 + m 2 oj 2 q 2 ) if we set £ 2 = Ax/Ap = ^ 

H osc 

I once had occasion 16 to indicate how quantum dynamics could be formulated 
as a “theory of interactive moments,” and by way of introduction looked to the 
oscillatory case. I digress to present a brief recapitulation of that discussion. 
Working in the Heisenberg picture, we look to the motion of (p) = (^>|p|^), 
where | ip) is arbitrary: 

|<p) = i([p,H])=-m W 2 <x> 

at which point we have acquired an interest in the motion of (x), so write 

|(x) = i([x, H]) = i(p) 

Those coupled first-order equations entail 

5 F?(x) t + W 2 (x) t = 0 =4- <x) t = (x) max cos(wt + 5) 

Therefore ^(p) = — rnw 2 (x) max cos(wf + S) which gives 

(p)t = (P>max sin M + < 5 ) with (p)max = TOW (x) ma x 

We have been brought thus by the simplest of means to the striking general 
conclusion that for all states \tp) the expectation values (x) t and (p) t oscillate 
with angular frequency to (90° out of phase, with interrelated amplitudes), just 
as do the classical variables {x(t),p(t)}. If one looks similarly to the motion 
of (p 2 ) one acquires an interest in the motion of (C) with C = |(xp + px), 


16 Advanced Quantum Topics (2000), Chapter2, pages 51-60. 
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whence of ( D ) with D = ^ p 2 — mur x 2 and finds that 

|<C) = ^([C,H]) = (D) 

3t(D) = i^([D, H]} = —4 w 2 (C) 

We infer that (C) t and (D) t both satisfy equations of the form 

at? (A) t + 4u> 2 (A) t = 0 

and therefore oscillate with doubled frequency: 


(C) t = (7sin(2a>t + <5) 

(D) t = 2wC cos(2wt + 5) 

Integration of 

®<P 2 >=ik<[P 2 ,H]> = -2mu, 2 <C> 

^(x 2 )=i([x 2 ,H]) = i(C) 

now gives 

(p 2 ) t = P 2 — mwCsin(2w< + <5) 

(x 2 ) t = X 2 + jj^jC sin(2wi + <5) 

where P 2 and X 2 are constants of integration that by energy conservation are 
constrained to satisfy 


l 

2m 


(pV 


1 2 

2 mu; < 


x 2 ) t 


= ^P 2 + ima; 2 X 2 = P 


In work previously cited 16 I discuss how—in principle—such arguments can be 
carried out with Hamiltonians of arbitrary design. 

The point to which I draw particular attention is that while H osc causes 
moments of all orders to oscillate with what might be called “classical rigidity,” 
it is (in the Schrodinger picture) typically not the case that H osc induces 
wavepackets to move rigidly: 


ip(x,t) ^ rigid translate of ip{x, 0) 


Generally 

IVOt = u(i)IVOo with U (t) = exp {- | H osc f} 

= e ~ l ^ ut • exp {- iut.U } 


Clearly 

|„) t = e -<("+§M|„) 0 (27) 

so the |n)-states (energy eigenstates) simply “sit there and buzz” with their 
respective frequencies. Writing %j) n (x,t) = (x\n) t we have 


1pn{x,t) = e »("+2 (£,()) => \lp n {x,t)\ 2 = \lp n {x,0)\ 2 


The probability density \ip n (x,t)\ 2 does in fact not move, but famously mimics 
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the positional distribution of a classical oscillator with energy E n = (n+ ^)Hco. 17 

Coherent states were seen at (6) to be linear combinations of number states 
(i.e., of energy eigenstates, each buzzing at its own frequency) and possess 
therefore the character of wavepackets. As it happens, the buzzing components 
conspire to produce a net motion which is in this instance quite distinctive. We 
have 

| a) t = • exp {— icotN }|a) 0 

From N = a + a and [a, a + ] = I we get aN fc = (l\l + l) fe a : k = 0,1, 2,3,... whence 


a - e uN = e u e uN a 


which when applied to |a) gives 

a ■ e uN |a) = ae u ■ e uN |a) =► e uN |a) = |ae“) 

We therefore have 

\a) t = e~^ ut ■ \ae~ iuJt ) (28) 

The prefactor e~ l ^ ut is unphysical; ae~ lUJt traces with angular velocity co a 
circle of radius a) on the complex a-plane. 

We established on page 12 that if we write a = + *£po) then in 

x -representation 

tp a {x) = ( x\a ) = (a;| D(a)|0) 

= exp{-!|x oPo } (^ Ax)2/ 

Since this is a minimal-uncertainty state, we have Ax Ap = fi/2 whence 


2) 4exp {~l( :i A?) +i Po x / n } ( 29 ) 


c 2 = At — ^(^ x ) 2 
s A p H 


Ax = s/hj2t 


But we saw on page 15 that when we import oscillator parameters we have 
£ = 1/y/mco, so 

Ax = i Jhj2 mco 

The evolved oscillator coherent state (displaced ground state) is described 

a(x,t ) = 1p a (t)( x ) 


where 

a(t) = -A=(^“'a;o + it;po){coscot — i sin cot) 

= {(^ _1 aa 0 cos cot, + £p 0 sinwt) + i(£p 0 cos cot — t;~'x 0 sin cot)} 

= J^{C{x 0 cos cot + ePo sinwf) + it; (po cos cot — t;~ 2 xo sin cot)} 

= ^{Z~' x (t) + i£p(t)} 


17 See FIGURE 2.7(b) on page 58 of David Griffiths’ Introduction to Quantum 
Mechanics (2 nd edition, 2005). 
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Calculation establishes that if £ 2 = then 

2 kP 2 (t) + \mu 2 x 2 {t) = j+ \mup-xl = E 

so the moving point {x(t),p(t)} traces an isoenergetic ellipse in classical phase 
space, while 

ip a {x,t) = exp { — \\x{t)p(t)} 

^2i^) ,eXP {-j( ?L AV ?? ) 2 + i! ’ (,)a:/S } <301) 

= (30 - 2) 

is a Gaussian which sloshes rigidly back and forth with period t = 2tt/u> and 

amplitude = \J2E/mu : 2 

Equations (29) describe the motion of a “launched oscillator ground state.” 

Wigner representation of coherent states. With nothing more than a passing 
allusion to “work done some years ago by Leo Szilard and the present author 
for another purpose,” E. P. Wigner 18 was content simply to pluck from his hat 
the construction 


P^(x,p) = lj tp(x + 0e 2 * p<: il)(x - ()d( 


(31) 


that in the hands of J. H. Groenwald 19 and especially of J. E. Moyal 20 was to 
give rise to the “phase space formulation of quantum mechanics,” to which a 
fairly comprehensive introduction can be found in Chapter 2 of my Advanced 
Quantum Topics (2000). It emerged that if the quantum observable A stands 
in “Weyl correspondence” with the classical observable 


A( x , v) =jj»(a,P)e 


A(x,p) 


Weyl 


A = / / a(a, (3) e^ Qp+f,x ^ dad/3 


and if P(x,p) is the inverse Weyl transform of the density matrix p, then 

18 “On the quantum correction for thermodynamic equilibrium,” Phys. Rev. 
40, 749 (1932). 

19 “On the principles of elementary quantum mechanics,” Physica 12, 405 
(1946). 

20 “Quantum mechanics as a statistical theory,” Proc. Camb. Phil. Soc. 45, 
92 (1949). 
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quantum expectation values assume formally the structure of the expectation 
values encountered in classical statistical mechanics: 


A ) p = tr A P = A(x, p)P{ x, p) dxdp 


In particular, one has 


trp = 1 = // P(x,p)dxdp 


as one would expect if P(x,p) referred to a “distribution on phase space.” But 
—and here quantum theory asserts its exceptionalism—Wigner distributions 
can (and typically do) assume negative values. At no point does this 
circumstance compromise their quantum mechanical utitility, but in its light 
they are called “quasi-distributions.” 21 It was known to Wigner, but first 
reported (without proof) by Takabayasi, 22 that P(x,p) is bounded above and 
below by a universal constant that becomes infinite in the classical limit: 

I P(x,p)\ < \ 

This in conjunction with f P(x,p) dxdp = 1 means that P(x,p) cannot have 
a “phase space footprint” with area less than which casts the Heisenberg 
uncertainty principle in interesting new light: a rectangle of that minimal area 
would have sides Ax and A p that satisfy AxAp = 

The phase space formulation of quantum mechanics is a lovely subject 
which, unfortunately, has been (at least until recently) largely ignored by the 
authors of quantum textbooks. 23 The subject has, however, acquired the status 
of an important tool in the hands particularly of physicists concerned with 
quantum optics and its application to foundational problems. It is, therefore, 
not surprising that the subject is touched upon in §3.7 of Garry & Knight. 5 

When the density operator refers to a pure state p = \ip)(ij)\ we recover 
precisely the Wigner distribution (31). If, in particular, we insert the coherent 
state (29) into (31) we (use Mathematical s Fourier Transform command) 

21 Feynman—among many others (including John Bell)—admitted to a 
fascination with the “negative probability” idea; for references, see pages 62-64 
and 71-72 in Chapter 2 of Advanced Quantum, Topics (2000). 

22 T. Takabayasi, “The formulation of quantum mechanics in terms of 
ensembles in phase space,” Prog. Theo. Phys. 11, 341 (1954). See especially 
§7 in that important paper. 

23 There is, for example, no mention of the subject in Griffiths or Mertzbacher, 
though Asher Peres, in Chapter 10 (“Semiclassical Methods”) of his Quantum 
Theory : Concepts and Methods (1993) does in his §10-4 provide a brief (and 
not-very-helpful) account of its bare essentials. 
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obtain 



with A p = yd/ Ax. The associated marginal distributions are 

X a (x) = j P a (x,p)dp= } (32.1) 

■? a (p) = Jp a (x,p)dx= 7 =L Xp ex p{- |(^) 2 } (32.2) 


Here as before, a = 'cco + i£,Po) where Xo, po and £ = Ax/A p serve 

as control parameters. Note that the Wigner functions of coherent states are, 
according to (32)—quite uncharacteristically of Wigner functions in general— 
everywhere non-negative. 


Set xq = Po = 0 and £ = 1/ \Jmoj ; then (32) becomes a description 
of the oscillator ground state. Displaced ground states (relax the conditions 
x o — po — 0) are perfectly good oscillator states which H osc sets into elliptical 
motion: 


P a (x,p,t) 



/_ i/ s-s(*)\ 2 _ l ( p-p{t) \ 2 \ 
\ 2 V Ax ) 2 V A P ) J 


In ^-representation the energy eigenstates of an oscillator (eigenstates | n) of 
the number operator) can in terms of the variables 


be described 


x = \J 2 mu / h ■ x 
p = \/2/mujH ■ p 

ipo(x) = 


dimensionless length 
dimensionless momentum 


ipi{x) = 
ip 2 (x) = 


\J a\f 2lr 
v ay/2-K 


e -^ 2 






with a = yjh/2rm/j, and give rise to Wigner distributions that can be written 


P 0 {x,p) = + |e 2 £ 

Pi{x,p) = -|e"5 £ (l - £) 

P 2 (x,p) = + |e-5 e (l-2£+i£ 2 ) 


(33) 


P n (x,p) = (-rle-i E L n (E) ) 

Here £ = x 2 + p 2 is “dimensionless energy,” interpretable as the squared radius 
of a circle inscribed on the {x, p}-coordinatecl phase plane, and the functions 
.£„(•) are Laguerre polynomials. At n = 0 we recover the Wigner function of 
the oscillator ground state. In all excited cases ( n > 0) the oscillator Wigner 
functions exhibit annular regions of negativity. In particular, P o dd(0,0) < 0. 
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In Mathematical, 7 use variants of the following command 

Plot3D[(-l) 3 Exp[-| (x 2 +p 2 )] LaguarreL[3,x 2 +p 2 ] ,{x,-6,6}, 
{p,-6,6}, PlotRange—>{-l, 1}, PlotPoints—>50, 

Mesh—> False, Boxed—>False , Axes—>False] 

to see what such functions look like. 


At this point (4:27pm, 11 December 2012) I received word that Richard 
Crandall has been admitted to the ICU at OHSU suffering from acute 
leukemia. Chemotherapy is under way, is expected to continue for 
some time, and he reportedly “feels like shit.” 

In 1940, Kodi Husimi (1909-2008)—then a young physicist at the 
University of Osaka—published a searching account of the theory of density 
operators. 24 Though somewhat tangential to Husimi’s main interests (nuclear 
physics, plasma physics, statistical mechanics, origami... and later: the control 
of nuclear weapons), the paper demonstrated an exceptionally secure command 
of quantum developments during the 1930s, but—oddly—it appears that Husimi 
had never heard of Wigner distributions. The theory of which he proceeded to 
reinvent... and to extend: Husimi devised a pretty way to “temper” Wigner’s 
quasi-distributions so that they become every where-non-negative proper 
distributions. Husimi’s idea is easy to describe, though I must direct my reader 
elsewhere 25 for a detailed demonstration that it works as claimed: he introduces 
a “smear function” which is none other than the Wigner function (32) of a 
coherent state 


G(x - x 0 ,p-po) 



1 [ P-Po l 2 ) 

2 L A J / 


(where er and A are subject to the “minimality condition” erA = \h) and proceeds 
P^(x,p) - > P^{x,p) = h G{x - x',p - p')P^(x',p')dx'dp' 

Husimi J J 

where a “poor man’s bold” symbol has been used to suggest that P^, is a 
“smeared” companion of P^. It can be shown without much difficulty that 
all Husimi functions are in fact everywhere-non-negative and normalized. But 
while Wigner functions satisfy 



P^,(x,p)dxdp = 1 


all | ip) 


as an expression of the pure-state condition trp 2 = 1, one finds that Husimi 

24 “Some formal properties of the density matrix,” Prog. Phys. Math. Soc. 
22, 264-314 (1940). The paper was published in English, but in a journal that 
was not widely available in the West until after the war; it took many years for 
its value to be appreciated. 

25 See once again Advanced Quantum Topics (2000), Chapter 2, pages 36-50. 
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smearing produces a result known to be characteristic of mixtures: 

hJJ P^(x,p) dxdp < 1 : all | i/j) 

The Husimi transforms of the oscillator energy eigen-distributions (33) are 
found to read 

P 0 (x,p) = i !e~i £ 


2 h c 

Pi{x,p) = g |e"3 £ . £ 
P'2{X,P) = ’ £ ' 


Pn(x,p) = ^ \e~^-r 

which are manifestly non-negative. One verifies that 


but discovers that 


II P n (x,p)dxdp= 1 : all 


h Pif(x.p) dxdp = \ < 1 


7l ^(x,p)dxdp = | <1 


h 11 Pi(x,p ) dxdp = ^ < 1 


(34) 


Historically (and still today), some people have looked upon the fact that 
Wigner distributions display regions of negativity as a fatal defect—reason 
enough to abandon the entire Wigner/Moyal formalism. Certainly those regions 
of negativity pose a problem if one proposes to use 


entropy 




to assign an entropy to quantum states and mixtures, for it leads to a “complex 
entropy,” a notion with which I have wrestled a bit elsewhere. Others have 
considered it to be evidence that the formalism stands in need of surgery. 
Husimi’s procedure (and others have been proposed) can be viewed in this 
light. 26 But what I myself find most striking about Husimi’s procedure is the 
novel use it makes of (the entire population of) coherent states. 

26 So little known did Husimi’s work for so long remain that after a lapse 
of 36 years Nancy Cartwright—a philosopher of science then at Stanford —felt 
called upon to reinvent it: “A non-negative Wigner-type distribution,” Physica 
83A, 210 (1976). Of course, neither Husimi nor Cartwright made mention of 
coherent states. 
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Yet another route to the coherent state concept. We looked first to the 
eigenstates of the operator a that arises when the oscillator Hamiltonian is 
factored, and were led to the overcomplete population of “coherent states” | a). 27 
When we looked to the functions that minimize the expression on the right side 
of Schrodinger’s inequality we (assuming that the observables in question are— 
like x and p —conjugate) were led back again to that same population of states, 
which acquired at this point the significance of “minimal uncertainty states,” 
states that kill the quantum correlation coefficient. We encountered those states 
(actually the Wigner representations of those states) once again as the “Husimi 
smear functions” that remove regions of negativity from Wigner distributions. 
I turn now to review of yet another line of argument from which coherent states 
emerge as natural objects. 28 

In 1965, two relatively obscure Bell Labs engineers who were interested in 
“determining the limitations of coherent quantum mechanical amplifiers, etc.” 
published a 5-page paper 29 that for nearly thirty years attracted very little 
attention. 30 It did, however, engage the immediate attention of C. Y. She and 
H. Heffner, two applied physicists at Stanford, who within seven months had 
submitted a paper that took Arthurs & Kelly as its point of departure. 31 

Arthurs & Kelly, elaborating on a dynamical measurement model proposed 

27 Supersymmetric quantum mechanics exploits the analogous material that 
emerges whenever a positive semi-definite Hamiltonian can be written as a 
Wieshart product Hi = W + W. In that theory the properties of H i are studied 
in conjunction with those of its companion H 2 = WW + . See Christopher 
Lee, “Supersymmetric quantum mechanics,” (Reed College Thesis, 1999), which 
provides a good introductory review and an elaborate bibliography. See also 
pages 4-7 of my “Simultaneous measurement of noncommuting quantum 
observables,” (October 2012). “Factorization methods” were pioneered by Dirac 
(1935) and Schrodinger (1940) and further developed in a famous paper by 
Leopold Infeld & T. E. Hull, “The factorization method,” Rev. Mod. Phys. 
23, 21-68 (1951). A Google search reveals that the literature has assumed vast 
proportions. An interesting recent contribution is J. Oscar Rosas-Ortez, “On 
the factorization method in quantum mechanics,” which is available on the web 
at arXiv:quant-ph/9812003v2 14 Oct 1999. 

28 It was this last line of argument that originally motivated this entire 
discussion. 

29 E. Arthurs & J. L. Kelly, Jr., “On the simultaneous measurement of a pair 
of conjugate observables,” Bell System Technical Journal 44, 725-629 (1965). 
The paper actually appeared in an appendage to that journal calledBSTJ Briefs. 

30 The writing is densely obscure, and the authors claim a result that most 
physicists were prepared to find dubiously perplexing. Besides, it was not the 
habit of most physicists to scan the BSTJ, even though it was the journal in 
which Claude Shannon and his distinguished Bell colleagues usually published. 

31 C. Y. She & H. Heffner, “Simultaneous measurement of noncommuting 
observables,” Phys. Rev. 152,1103-1110 (1966). Heffner maintained a working 
relationship with W. H. Louisell at Bell Labs. 
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by John von Neumann in the final pages of his Mathematical Foundations of 
Quantum Mechanics (1932), imagined a two-detector device that upon 
completion of its interaction with a system in initial state if{x) announces 
{cco,Po}i signaling (as they show) that it has accomplished 

JL 2 

V’before(z) -♦ Rafter(*) = )' eX P { “ \ (^r) + ^ X / H } 

where the value of Ax is set by the specialized initial states of the detectors. 
The point of immediate relevance (see again (29)) is that as a byproduct of its 
activity the Arthurs/Kelly device prepares coherent states. 

She & Heffner—who were familiar not only with quantum optics (then 
in its infancy) but also with the Bayesian information-theoretic approach to 
physics (and to science generally) propounded by E. T. Jaynes 32 —proposed to 
recover essential aspects of Arthurs/Kelly’s result from an appeal to a maximal 
entropy principle. I turn a review of their argument, which they relegate to an 
appendix. 

With She & Heffner, we seek the positive definite self-adjoint operator p 


that maximizes the von Neumann entropy 

S = -tr(plogp) 
subject to these five constraints 

trp = 1 (?) 

tr(xp) = Xq ( ii ) 

tr(pp) = Po {Hi) 

tr (x 2 p) = Xq + {Ax) 2 (iv) 

tr(p 2 p) = Po + {Ap ) 2 (v) 


of which the first completes the list of requirements imposed upon density 
operators. We suppose Xq and p o to be freely specifiable, but anticipate that Aa; 
and Ap will ultimately become subject to the familiar constraint Ax Ap ^ 
Evidently p can be written 

p = exp | Ao I + Ai x + A2 x 2 + A3 p + A4 p 2 } 
where the As are Lagrange multipliers. To evaluate the traces we might bring 
| Ao I + Ai x + A2 x 2 + A3 p + A4 p“} exp | Ao I -I- Ai x -I- A2 x 2 + A3 p + A4 p“} 
to xp-ordered form and make use of the “mixed representation trick.” 33 But 

32 They cite E. T. Jaynes, “Information theory and statistical mechanics,” 
Phys. Rev. 106, 620-630 (1957), but more relevant is “Information theory and 
statisticalmechanics.il,” Phys. Rev. 108, 171-190 (1957). Jaynes, by the way, 
had been a student of Wigner. 

33 See Advanced Quantum Topics, Chapter 0, page 38. A relevant ordering 
identity (“McCoy’s theorem”) appears on page 34. See also page 41. 
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that program is somewhat easier to execute if one expresses all {x, p}-operators 
in terms of the {a, a + }-operators introduced at (23). Writing 34 

log p = Mo - Mi( a+ ~ «)( a - a) 

= (Mo + aa - g 2 h) - Mi{.9 2 £ -2 x 2 + M 2 ? 2 P 2 

- (s + fljsf 1 * + i(a-d)g£ p} 

= Mo — Mi A + A with A = a — a I 

we could read off descriptions of {Ao, Ai, A2, A3, A4}, the role of which has been 
taken over by {mo, Mi> 3?(a), 3(a)}. We now have 

p = e^° exp { — mi A + A } 

which by [A, A + ] = I and a corollary e“ AB = exp | ^ ^ -A:B| of McCoy’s 

theorem 33 assumes the ordered form 

= e 4 * 0 exp { — (1 — e _/il )A + : A} 

where the colon signifies that all A + s are to be placed left of all As, as in the 
following example: e A:B = (this notational convention is due to 

Schwinger). So we have 


tr P = yj ( a\p\a)d 2 a 

= e Mo JJ exp {(e - ^ 1 — 1) (a — a) (a — a )} d 2 a 
= e Mo (l — e _Ml ) _1 • \ f [ d 2 (3 with /3 = \/l — e -M i (a — a) 


= e^(l-e _Ali r 

which to achieve tr p = 1 enforces 


e A ° = 1 - e~^ = e (35.1) 

The density operator has at this point assumed the form the form 

p = e exp { — e A + : A } 


Constraints (it) and (in) can be consolidated, to read 
tr(ap) = g(C 1 x 0 + i^p 0 ) 


34 As a notational convenience I have dropped all I operators and adopted 
the abbreviation g = 
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Arguing as before, we have 


tr(pa) = \ (a\pa \a)d 2 a 


= -P / / a ' £ ex P { — £ (a — a) (a — a)} d 2 


\ff (« + £ M) 


ee 


-00 d 2 0 


because Iff exp(— \(5\ 2 )d 2 (3 = 1 while ff /? exp(—1/3| 2 ) cZ 2 /3 vahishes by a parity 
argument. So we have 

a= + ( 35 - 2 ) 

It remains to work out the implications of (iv) and (v). To that end we observe 
that 

A = a -a = p(£ _1 (x -x 0 ) + i£(p - p 0 )) 

A+ = a+ - a = s(£ _1 (x - x 0 ) - i£( p - p 0 )) 

give 

x-x 0 = (A++A) 

P -Po = i\g~ 1 C l {k + - A) 

SO 

(x — xq) 2 = \g~ 2 ? (A+A+ + 2AA++ AA - I) 

(P -Po) 2 = -y~ 2 C 2 ( A+A+ -2AA+ + AA + I) 
and therefore (use \g~ 2 = and exploit trAB=trBA) 

(Ax) 2 = tr[(x - x 0 ) 2 p] 


= i/r^ 2 tr|A+A+p + 2A+pA +pAA - pj 
= |fi£ 2 ' nJJ («|{etc.}|a)d 2 a 


= |[(d-a) + (a-a)] 2 -l| 


• £ exp {— e(a — a) (a 

— a)}d 2 a 

= -.//{ e " 1 (^ + ' S ) 2- l} e "^ d2 ' S ; 

/3 = x + iy 

= ^ 2 •(§-!) 

- 1 + e-^ 

2 5 l-e-Mi 

(35.3) 

Similarly 


(Ap) 2 = ifir 2 - i 1 + e ' /il 

v ' 2 s 1 - e-^ 1 

(35.4) 
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Equations (35.3/4) conjointly supply 

,2 _ Ax 
? A p 

(which we had occasion to write already at (23) on page 12) and 


Mi = log 


(A pf + kH 




)-Mw) ^ 


(A p ) 2 - 

from which by (35.1) it follows that 
Mo = log (! - e _Al1 ) 


(35.5) 


(35.7) 


Equations (35.7/6/5/2) describe in terms of {xo,Po, Ax, Ap} the values that 
must be ascribed to the parameters {/io, pi, £, 3?(a), 3(a)} if p is to conform to 
the constraints (i)-(u). 

It is natural (and, as will emerge, instructive) to ask: Does 

p = e exp {— eA + : A } (36) 

describe a pure state (trp 2 = 1) or a mixed state (0 < trp 2 < 1)? Adjustment 
of the argument that led to (35.1) supplies 


p 2 = e 2 ^ 0 exp { — 2 pi A+A} = e 2M ° exp {- (1 - e 2Atl )A+: A} 

trp 2 = e 2Mo (l - e -2 ^ 1 ) -1 

= (l-e-w) 2 (l-e- 2 ^) = f ^ Ll) 


Plot f(pi) and see that 


trp 2 is 


< 0 

if 

Mi < 0 

oo 

if 

Mi = 0 

> 1 

if 

Mi > 0 

-> 1 

as 

Mi 


so the only tenable case is the last one: p becomes an admissible density 
operator if and only if pi = oo, which by (35.6) requires Ax A p = }/i, in 
which case p refers to a pure state. 35 


35 Here She & Heffner dropped the ball: they assert without argument (citing 
only §VI of Glauber, 2 though Glauber does not really address the issue) that 
p is positive-definite if and only if 1 — e _Ml ^ 0, which requires p± ^ 0. But 
in those cases p cannot be positive-definite, since trp 2 > 1 becomes consistent 
with trp = 1 only if p possesses negative eigenvalues, as illustrated by the 
following simple example: 


2 0 
0 -1 


trM = 1, trM 2 = 3 > 1 
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If hi = oo then po = 0, e = 1 and (36) assumes the simple form 
p = exp { — A + : A } 

= exp { — (a + — a): (a — a)} (37) 

and the question arises: To what state does p projectively refer? She/Heffner 

proceed from the observation that (by (7.1)) 

( a \p\P) = exp{- (a - a)(/3 - a)} • {a\(3) 

= exp {— a/3 + aa + (3a — |a| 2 } • exp { — ||a| 2 — \\/3\ 2 + a/3} 

= exp {— ||a| 2 — - \a\ 2 + aa + (3 a) 

while 

(a|a)(a|/3) = exp { - f|a| 2 - i|a| 2 + aa} • exp { - i|a| 2 - i|/3| 2 + (3a) 

= exp {- ||a| 2 - \\(3\ 2 - \a\ 2 + aa + (3a] 

So 

{a\p\(3) = (a|a)(a|/3) : all coherent states |a), \(3) 

p=|a)(a| where a = -^(£~ 1 x 0 + i^Po) by (35.2) 

£ = \/Ax/Ap by (35.5) 


Which is to say: The unique (pure) density operator that conforms to the 
constraints (i)-(u) projects onto the coherent state which, as we saw at (29), 
can in ^-representation be described 


^a(a0 



1 



x - 
Ax J 



Alternatively, we might have proceeded 


P a (x,p) -— 

inverse Weyl 
Beck inversion trick 

1pa( x ) 

but I omit those details. 36 


P 


She & Heffner drew inspiration from Jaynes’ “maximal entropy principle,” 
and it was the construction S(p) = — tr(plogp) that supplied the preceding 
discussion with its analytical leverage. It is curious, therefore, that we have not 
had to construct an explicit description of S'(p)—though we are in position to 
do so—and have not demonstrated that S(p) is in fact maximal , though since 
p is unique (the element of a single-member set) it is necessarily so. 


36 See Advanced Quantum Topics (2000), Chapter 2, pages 13-14. 
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But why “coherent”? Possible generalizations. We found that the a-algebra, 
which sprang historically from Dirac’s approach to simple oscillatory theory, 
has a life quite independent of oscillators, and when pushed farther (Glauber, 
Bargmann) than its mechanical application demands gives rise to overcomplete 
eigenstates |a) that are called “coherent states” for reasons that no mechanical 
usage can justify. We found that those same states emerge when one looks for 
the states that reduce Schrodinger’s inequality 


(4.|W^) ; 


quantum correlation coefficient 


(38) 


to equality, and that they might for that reason be called “minimal uncertainty 
states.” They are, moreover, the states prepared by Arthurs/Kelly’s idealized 
“simultaneous {x,p [-measurement device.” And they are the states employed 
by Husimi to heal Wigner quasi-distributions of their regions of negativity. 
Finally, we found —in the She/Heffner work that inspired this essay (again using 
the resources of a-algebra)—that an information-theoretic search for entropy- 
maximizing the density operator p a fter that conforms to specified values of (x), 
(p), (x 2 ) and (p 2 ) leads back again to that same population of states, which 
might on that basis be called “maximal entropy states.” 

So why have the eigenstates |a) of a-operators—for which our experience 
has suggested at least two plausible names—come to be called “coherent states” ? 
A clue is provided by the observation that in the papers of Bargmann (1961/62) 
and Segal (1963) they do not bear that name, while a reference to “coherence” 
appears already in the title (“Coherent and incoherent states of the radiation 
field”) of Glauber’s paper (1963). 2 “Coherence” is a notion that arises in the 
physics of waves (originally optics and acoustics, later “wave mechanics”), 
as a precondition for the occurance of interference effects when two or more 
waves are superimposed (“it takes two to interfere,” two to cohere). It is, 
therefore, a notion irrelevant to the physics of solitary waves, or the mechanics of 
systems with only one moving part (oscillators, for example, or particle-in-a-box 
systems). Quantum optics (bosonic field theory, which is effectively a theory of 
systems of oscillators) is, however, a subject into which correlation/coherence 
enter as essential statistical ideas. It is, I suppose, in the light of this conceptual 
linkage that the states which kill the quantum correlation term in (38) came to 
be called “coherent states.” 

The Schrodinger inequality is “binary” in that it posits the existence of 
two non-commuting operators (A and B: “it takes two to correlate”), speaks 
statistically about a relationship between two experimental numbers: A A and 
A B. By specialization it speaks about cases in which A and B are conjugate: 
[A, B] = ih\. It becomes natural to ask: Can the Arthurs/Kelly simultaneous 
measurement scheme and/or the She/Heffner argument be modified to 
accommodate cases in which the observables, though non-commuting, are not 
conjugate? 37 Noting that while conjugacy is a binary concept, non-commutivity 


37 Shu/Heffner appear to claim so in the title (“Simultaneous measurement of 
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is not, it becomes natural to ask: Can analogs of the Schrodinger inequality, or of 
the Arthurs/Kelly measurement scheme, or of the She/Heffner maximal entropy 
argument... be developed which contemplate the existence of three or more 
non-commutative observables {A, B, C,...}? Is it possible, for example, to 
speak of an optimal simultaneous measurement of all three components of 
angular momentum (spin)? Finally, it would be of interest to describe (which 
we appear to be in position to do without much difficulty) the entropy change 

AS ^post-measurement *5pre-measurement 

brought about by simultaneous measurement processes. 38 


noncommuting observables”) of their paper, but do not really address the issue. 
Note that the a-algebra, of which they make essential use, is itself based upon 
a conjugacy statement: [a, a + ] = I. 

38 Aspects of this topic were addressed (inconclusively) in a Reed College 
physics seminar (“Quantum theory of measurements, and entropy”) presented 
by F. J. Belinfante on 3 April 1985, of which I possess what is, I suspect, the 
only surviving copy of the unpublished text. 



